Diversely evolved xibalbin variants from remipede venom inhibit potassium channels and activate PKA-II and Erk1/2 signaling

Background The identification of novel toxins from overlooked and taxonomically exceptional species bears potential for various pharmacological applications. The remipede Xibalbanus tulumensis, an underwater cave-dwelling crustacean, is the only crustacean for which a venom system has been described. Its venom contains several xibalbin peptides that have an inhibitor cysteine knot (ICK) scaffold. Results Our screenings revealed that all tested xibalbin variants particularly inhibit potassium channels. Xib1 and xib13 with their eight-cysteine domain similar to spider knottins also inhibit voltage-gated sodium channels. No activity was noted on calcium channels. Expanding the functional testing, we demonstrate that xib1 and xib13 increase PKA-II and Erk1/2 sensitization signaling in nociceptive neurons, which may initiate pain sensitization. Our phylogenetic analysis suggests that xib13 either originates from the common ancestor of pancrustaceans or earlier while xib1 is more restricted to remipedes. The ten-cysteine scaffolded xib2 emerged from xib1, a result that is supported by our phylogenetic and machine learning-based analyses. Conclusions Our functional characterization of synthesized variants of xib1, xib2, and xib13 elucidates their potential as inhibitors of potassium channels in mammalian systems. The specific interaction of xib2 with Kv1.6 channels, which are relevant to treating variants of epilepsy, shows potential for further studies. At higher concentrations, xib1 and xib13 activate the kinases PKA-II and ERK1/2 in mammalian sensory neurons, suggesting pain sensitization and potential applications related to pain research and therapy. While tested insect channels suggest that all probably act as neurotoxins, the biological function of xib1, xib2, and xib13 requires further elucidation. A novel finding on their evolutionary origin is the apparent emergence of X. tulumensis-specific xib2 from xib1. Our study is an important cornerstone for future studies to untangle the origin and function of these enigmatic proteins as important components of remipede but also other pancrustacean and arthropod venoms. Supplementary Information The online version contains supplementary material available at 10.1186/s12915-024-01955-5.

Only one venomous species of marine crustaceans has been described so far in more detail [26,27].Xibalbanus tulumensis belongs to the crustacean class Remipedia, which was first described over 40 years ago (Yager 1981) and currently comprises 28 extant species [28,29].However, the internal relationships of remipedes remain challenging [6,[26][27][28].Phylogenomic analyses show that remipedes share a common ancestor with hexapods, making them a key taxon for comprehending insect evolution [30][31][32][33][34].The biology and ecology of remipedes are not yet comprehensively understood, likely due to the extraordinary and secluded environment they inhabit as stygobionts in the marine saltwater regions of anchialine underwater cave systems [35].
The venom system of X. tulumensis and its anatomy has been studied using synchrotron-based µ-computer tomography in the first comprehensive publication about remipede venom [26]; see Fig. 1.
Remipedes have large thoracic glands connected to reservoirs in their second maxillules, which release venom through an apical pore under a terminal claw [36].This venom injection is facilitated by complex muscle arrangements [26,36], though it is worth noting that direct observations of remipedes catching prey are lacking.Transcriptomics identified putative venom components, later detailed in the first proteo-transcriptomics study using squeezed-out gland lumen proteome [27]; see Fig. 1.Three main groups of peptides and proteins were found: enzymes, including chitinase, peptidase S1, and LDLa-domain containing peptidase S1; moderately expressed non-enzymatic proteins; and novel peptides (xibalbins).Many of these peptides resemble inhibitor cysteine knot peptides (ICKs or knottins), known for their robustness against enzymes, heat, and pH due to their characteristic cysteine scaffold that results in specific numbers of disulfide bridges [37,38].In various animal venoms, ICK peptides function as neurotoxins, hemolytics, or antibacterials, and they are also explored for pharmacological and agrochemical applications [3,39].
The hypothesized mechanism of envenomation by remipedes, based on proteo-transcriptome data sequence similarities [27], suggests that xibalbin1 (xib 1 ), xibalbin2 (xib 2 ), and xibalbin13 (xib 13 ) ICK-like protein families, and the double-ICK-like xibalbin3 (xib 3 ), could act as putative neurotoxins that rapidly cause paralysis in prey during an attack.Simultaneously, the proteins and enzymes break down internal tissues and structures, resulting in prey liquefaction and subsequent feeding by remipedes [27].However, it should be noted that bioactivity tests for the venom compounds of remipedes, particularly the ICK-like peptides, have been lacking so far.
In this study, we investigate the bioactivity of synthetic variants of xib 1 , xib 13 , and xib 2 focused on possible application potential, examining their cytotoxicity (including cancer cell lines), ability to modulate ion channels, and impact on sensory neurons.Our research explores also insight into their biological functions by testing insect targets.Furthermore, we shed first light on the diversity and origin of ICK-like peptides in remipedes by including sequences from further species besides X. tulumensis using phylogenetic and machine learning approaches.

Results
Xib 1 is unique in remipedes while xib 13 is similar to other ICKs, whereas xib 2 and xib 3 are specific to Xibalbanus To broaden the scope of our study beyond the single species X. tulumensis, we examined potential variants of xib 1 , xib 2, xib 13, and xib 3 (double ICK, not tested in this study) in four other remipedes.This analysis was based on de novo assembled transcripts from X. tulumensis from which secreted proteins were identified proteomically [27]; see Figs. 1 and 2. Additionally, we used transcriptomes of whole animals, including venom systems, from Lasionectes entrichoma, Morlockia williamsi, Godzillignomus frondosus, and Pleomothra apletocheles that have been published [33,40].To identify venom proteins in these published data, an automated search pipeline [41] was employed utilizing hmmerbased identification resulting in final alignments of xibalbins (Fig. 2); see Methods for further details.
Transcripts of xib 1 were found in four of the five remipede transcriptomes, exhibiting higher sequence variation than xib 13 .Both share an identical scaffold with eight cysteines in the mature peptide (see Fig. 2).Similar sequences to xib 2 or xib 3 (which feature a double ICKlike domain) were not recovered in the four other remipedes, except for X. tulumensis (see Fig. 2).No other ICK groups with different scaffolds were found, such as the known six-cysteine backbones from insects or cone snails [26,27].The mature sequences for chemical synthesis were predicted using the xibalbin alignments including all transcripts from the five available remipede species; see Fig. 2.Only transcripts with signal peptides described in the proteo-transcriptomic study [27] were considered.The transcripts obtained from the four reassembled novel remipede transcriptomes (besides X. tulumensis) guided us additionally to identify mature sequences of representative peptide variants; see Fig. 2 and "Methods." The xibalbin 2 variants that we name xib 2a and xib 2b derive from the same sequence but differ in two ambiguously predicted propeptide cleavage sites (Fig. 2).All xibalbin variants were produced synthetically by nonselective refolding; for details, see "Methods." It has to be noted that the purity of the products differs (Xib 1 : 88.0%, xib 12 : 84.9% purity, xib 2a : 99.9%, xib 2b : 74.4%) and that the purity < 75% for xib 2b is less ideal for bioactivity tests; see Additional File 1: Figure S1.

Xib 1 , xib 2 , and xib 13 strongly inhibit voltage-gated potassium channels and mildly inhibit selected sodium channels while they show no inhibition of calcium channels
We tested the xibalbin variants on a broad selection of voltage-gated potassium (Fig. 3), sodium (Fig. 4), and calcium channels (Fig. 5).The relevant channels were exogenously expressed in Xenopus laevis oocytes, and their activities were measured using the two-electrode voltage clamp technique.
Xib 1 and xib 13 were found to exhibit a potent inhibition on voltage-gated potassium channels (Kvs) and a mild inhibition on voltage-gated sodium channels (Nav).In contrast, xib 2a and xib 2b displayed a preference for a limited range of Kvs.Xib 2a specifically targeted the Nav1.4 isoform with a high degree of selectivity, while xib 2b inhibited solely Kvs.Notably, no detectable activity was observed on T-type calcium channels Cav3.1, Cav3.2, and Cav3.3 (Fig. 6).
Furthermore, the peptide's impact on Nav channels was assessed by analyzing the current-voltage relationships of xib 1 , xib 13 , and xib 2a , providing insights on its ion channel blocking mechanisms (Fig. 7).The data at V half indicates that the peptides serve primarily as pore blockers for the majority of Nav isoforms since most blocked Nav channels did not present a notable shift in their activation and steady-state inactivation curves, when compared to control (Tables 1 and 2).
Xib 2a solely inhibited the Nav1.4 isoform (Fig. 7 C) and significantly altered its V half of activation (− 23.0 ± 0.1 mV to − 17.6 ± 0.1 mV), with no notable shift of the V half of inactivation (− 44.6 ± 0.2 mV to − 44.9 ± 0.3 mV).Notably, these data suggest that besides interacting with the pore region of these Nav isoforms, by reducing the flow of ions through the channel, xibalbins may also interact with the voltage sensor domain of some Nav isoforms (Figs. 6 and  7).Nonetheless, additional studies are needed to fully elucidate their precise mechanism of action.

Xib 1 , xib 13 , and xib 2 do not show overt cytolytic or cytotoxic activity
Having characterized the primary structure of xibalbins and having tested them on selected ion channels, we aimed to determine the biological activity of xibalbins on  S1 for individual data values the primary cells of adult male rats.The diverse interactions with voltage-gated ion channels suggest that xibalbins have effects not only on electrically active cells such as neurons and cardiomyocytes but also on kidney cells.Consistent with previous studies on other ICK toxins, we evaluated the activity of xibalbins on cultured sensory neurons [42][43][44].
We initially determined the cytotoxic effects of xibalbins on primary sensory neurons from adult male rats cultured in vitro [2].Sensory neurons exhibit increased sensitivity to changes in size or loss of attachment following exposure to cytotoxic levels of calcium [45].We exposed overnight cultures of dissociated rat dorsal root ganglions (DRGs) to xibalbins for 5 and 30 min.Subsequently, all cells were fixed and immunocytochemically stained for the neuronal marker UCHL1 to identify neurons.The entire culture was digitally scanned using high content imaging (HCI) microscopy, and cell numbers, UCHL1 staining per cell, and cell size was analyzed.No loss of neurons was induced by either of the tested xibalbins, even at the highest concentration of 0.8 µg/µL (Additional File 3: Figure S2).Additionally, no differences were observed in the size distribution (Additional File 4: Figure S3 A, C) or the staining of the neuronal marker UCHL1 (Additional File 4: Figure S3 B, D).We further investigated the potential toxicity of xibalbins in monocyte/macrophage-like cells (RAW264.7) and human microvascular endothelial cells (HMEC); see Additional File 5: Figure S4 The viability of RAW264.7 cells was detected using a formazan-based assay, while the proliferation of HMECs was assessed by measuring the number of crystal violet-stained cells.The tests did not detect any significant cytotoxic activity of the tested xibalbins.Knottins have been demonstrated to modify neuronal activity by acting on voltage-gated ion channels.To gauge sensory neuron activity in response to various activating stimuli, including electrical activity, phosphorylation state detection of protein kinase A type II (PKA-II) and MAP kinase Erk1/2 can act as surrogate measurements [42,[46][47][48].Consequently, we tested for increased phosphorylation states in a concentration-dependent manner following exposure to xibalbins.Dissociated DRG neurons were cultured overnight and then exposed to increasing concentrations of the corresponding xibalbins for 5 min and 30 min, respectively.As a positive control, forskolin (Fsk) was also used to induce cAMP synthesis [49,50].The activity of PKA-II was monitored by antibodies directed against the phosphorylation site of the inhibitory regulatory subunits RIIα/β, which is exclusively accessible when the catalytic kinase domain is released during kinase activation [42].For measuring Erk1/2 activity, phospho-sites on Erk1/2 (T202/ Y204) were monitored through immunofluorescence, as these sites are phosphorylated during activation [51,52].Cellular images were captured using HCI microscopy (Fig. 8A), and the average intensity of each phospho-antibody was quantified (Fig. 8B).
At the highest tested concentration of 0.8 µg/ µL(154 µM), xib 1 caused a significant increase in pErk1/2 intensity by 2.17-fold (q = 4.07); see Fig. 8F.However, no changes were observed in the intensity of either phospho-PKA-II or pErk1/2 with xib2a and xib2b exposure at either of the two exposure times (Fig. 8C-G).

Xibalbins induce Erk1/2 and PKA-II activation in primary nociceptive and non-nociceptive sensory neurons
Cultures of primary sensory neurons consist of a variety of different neuron subtypes.This includes large-sized non-nociceptive and small-sized nociceptive neurons.To characterize if xibalbins activate either one or both neuron subtypes, we analyzed the response according to the cell size.Forskolin used as a positive control to increase phospho-PKA-II activity did not show a prevalence and activated both small and large-sized sensory neurons (Fig. 8G, H).Upon testing with the highest concentrations, xib 1 and xib 13 also showed increased phosphorylation signals in both small-sized and large-sized sensory neurons (Fig. 8, H).Therefore, it can be concluded that xib 1 and xib 13 act on both nociceptive and non-nociceptive neurons.

No signaling induced by pain-inducing mediators
Xibalbins that modulate sensitization signaling in nonnociceptive and nociceptive sensory neurons suggest the involvement of these toxins in sensation.Therefore, we conducted further tests on pain-related cellular activity including also non-neuronal cells such as HEK293, RAW264.7 macrophage cell line, and leukocytes.Paininducing mediators such as bradykinin, prostaglandin E2, LPS, and TNF act by, e.g., increasing Ca 2+ influx through calcium channels, leading to a significant increase in intracellular calcium ([Ca 2+ ] i ), by an increase of cyclic adenosine monophosphate (cAMP) synthesis, and/or by  (n ≥ 3) ± SEM; SEM standard error of the mean.Statistics were calculated using one-way ANOVA, followed by Tukey's multiple comparisons test.Differences were considered statistically significant when p < 0.01; see Tables 1  and 2. See "Methods" and Additional File 2: Table S1 for individual data values induction of NO synthesis [53].Such elevated levels of intracellular Ca 2+ , cAMP, or NO can contribute directly and indirectly to an increase in neural activity.Consequently, this can lead to a heightened perception of pain, which is relayed to the central nervous system via direct and indirect pathways.
We examined the potential of xibalbins in a concentration range of 0.25 to 25 µg/ml regarding their effects on Ca 2+ influx in HEK293 cells, which express calcium channels.Forskolin, utilized as the positive control, increased [Ca 2+ ] i in HEK293 cells [54].However, exposure to none of the four xibalbin variants by itself did increase [Ca 2+ ] i as measured by Fluo-8 calcium imaging assay.Additionally, none of the xibalbin variants modulated forskolininduced [Ca 2+ ] i increase (Additional File 7: Figure S5  A).We also evaluated the effect of xibalbins on cAMP synthesis.However, xibalbins had no effect on increasing cAMP synthesis in HEK293 cells or on preventing the forskolin-induced cAMP synthesis (Additional File 7: Figure S5 B).
Nitric oxide has a complex and diverse role in pain modulation [55].Our study reveals that xibalbins did not induce NO synthesis in RAW264.7 macrophages and is ineffective in preventing LPS-induced NO synthesis.Additionally, xibalbins showed no cytotoxic effects in RAW264.7 macrophages (Additional File 7: Figure S5 C).
Finally, we analyzed the effects of xibalbins on the leukocyte adhesion to the vascular endothelium, which is a critical step in the inflammatory response of inflamed tissues.Xibalbins were analyzed for their ability to alter the adhesion of human monocytic (THP-1) cells onto a TNF-activated endothelial cell monolayer (Additional File 5: Figure S4).Thus, we do not find xibalbins to have an impact on Ca 2+ , cAMP, and NO signaling in the cell types analyzed.They do not interfere with the adhesion of leukocytes to endothelial cells.All individual values for these experiments are given in Additional File 8: Table S3.

Diversity and evolutionary origins of ICK-like xibalbins
To explore the diversity and evolutionary origins of all ICK-like xibalbins, we aligned all full protein sequences from the five remipede species (see Fig. 2) with published ICK peptide sequences from arthropods, see "Methods".These complementary arthropod sequences include confirmed and predicted ICK peptides from venomous and non-venomous crustaceans, insects, myriapods, and chelicerates.The additional sequences were acquired from two studies that collated ICK peptides from pancrustaceans and arthropods [27,56] to reconstruct a maximum likelihood-based phylogenetic tree; see   Table S2 for individual data values "Methods".Although our topology remains unresolved in some, especially deeper nodes, we can infer important insights related to the evolution of xibalbins.
Our findings demonstrate that xib 13 sequences, which are present in all five remipedes, are located in a wellsupported clade (88 fast bootstraps) indicating a remipede-specific family that clusters with sequences from the notostracan crustacean Triops and basal hexapods that display a similar eight-cysteine scaffold akin to xib 13 (see Fig. 9).Xib 1 sequences (with their eight-cysteine scaffold similar to xib 13 ) constitute compared to xib 13 a more distinct, strongly supported clade (97 fast bootstraps); see Fig. 9.
Surprisingly, our phylogenetic analysis demonstrates that xib 2 , the ten-cysteine scaffolded xibalbin limited to X. tulumensis, emerges from the eight-cysteine scaffolded xib 1 by an apparent gene duplication.The result is an ancestral, transitional xib 2 sequence (c149924g1i1) with only eight and not ten cysteines which is highly similar in its primary sequence to the other xib 2 family members; see Fig. 9. Finally, we show that xib 3 sequences are closer related to six-cysteine scaffold ICKs from hexapods which indicates a possible six-cysteine variant that occurred in the ancestor of remipedes and hexapods.This result is in line with the findings of Maxwell et al. [56], who propose that xib 3 derives from a domain duplication of an ancestral six-cysteine precursor that has since been lost in X. tulumensis.The findings in our phylogenetic analyses, especially the origin of xib 2 from xib 1 , are as well supported in a CLANS analysis using pairwise sequence similarity clustering; see "Methods" and Additional File 10: Figure S6.

Machine learning analysis largely corroborates the phylogenetic analysis
To complement our phylogenetic analysis, we employed a novel machine learning (ML) method that constructs a multidimensional space of ICK relationships.This approach utilizes protein language models to generate a 1024-dimensional representation of proteins, known as "protein embeddings" [57].These embeddings capture similarities based on the model's understanding of protein structure and function, similar to how natural language processing understands text and predicts the probability of words appearing in a specific order.This method considers not only the sequence of amino acids but also their positions and interactions, even those separated by longer stretches, and captures nuances in structural and functional properties that are not apparent in direct sequence comparisons.It thus recognizes evolutionary relationships and functional classes of proteins without relying on "traditional" sequence alignment techniques that are based on positional homology; hence, it is termed "sequence independent".We have to note though that the actual evolutionary process is more complex than what can be shown in visual representations due to the vast number of possible functional ICKs and the constraints of each lineage's inheritance.For visualization purposes, we condensed this complex space into 3D, with 2D representations used in our figures (Fig. 10).
Our results largely corroborate the phylogenetic findings.Notably, xib 13 sequences appear as the most ancestral within the remipede-specific cluster, given that xib 13 represents a subfamily within a larger arthropod ICK subgroup.Furthermore, other remipede ICKs are positioned as close neighbors to xib 13 , reinforcing this interpretation.
Regarding xib 2 , our analysis revealed an intriguing bifurcation.While one form of xib 2 clusters with other arthropod ICKs, its isolated positioning casts doubt on the significance of these associations.Interestingly, that sequence is the transitional xib 2 sequence with only eight cysteines, which could mean that it exhibits as well a different or intermediate function that is more similar to xib 13 resulting in this clustering pattern.In contrast, another variant of xib 2 with ten cysteines is near the xib 1 cluster, forming a distinct and tight grouping.This observation provides strong support for the close relationship between xib 1 and xib 2 as suggested by our phylogenetic analysis.
Lastly, the xib 3 sequences displayed a clear clustering pattern by domain.Domain1 sequences of xib 3 were found in close vicinity to a dense cluster of centipede ICKs.This arrangement suggests a potential evolutionary trajectory for xib 3 , likely originating from a centipede ICK and undergoing subsequent domain duplication.The tree was calculated in IQTREE (-m MFP -alrt 25,000 -B 25000 -bnni -T Auto); for further details, see "Methods".We summarized several nodes for better visualization (triangles); see the full tree file (Additional File 9: Figure S6) (See figure on next page.)Fig. 10 Machine learning generated protein space representations of xibalbins correspond with gene phylogeny-based grouping.All proteins corresponding with the sequences used to reconstruct the phylogenetic tree are color coded according to their major taxon clade (A) that are also indicated in the phylogenetic tree (Fig. 9) and additionally labeled according to their cysteine scaffold (B).See "Methods" for further details

Discussion
We discuss here predominantly the examined activities of the xibalbin variants in the context of their potential pharmacological application and give first insight on their origins and evolution.

Ion channel activity
Among the tested xibalbin variants, xib 13 exhibited the most significant inhibition rates on voltage-gated potassium (Kv) and sodium (Nav) channels (Fig. 6).Intriguingly, despite their broad effects on Kv and Nav channels, none of the xibalbin variants affected T-type voltagegated calcium channels (Cav3.x);see Fig. 5.Other examples of toxins acting on both sodium and potassium channels come from different species.For example, Nc1a, derived from Nephila clavata spider venom, acts on Nav and Kv channels in cockroach dorsal unpaired median neurons [58].Similarly, HCTx from Heteractis crispa sea anemone venom shows remarkable target promiscuity [59,60].
Both ion channel families (Kv and Nav) were inhibited by xib 1 and xib 13 with a preference towards voltage-gated potassium channels and a higher percentage of inhibition when compared to voltage-gated sodium channels.Xib 1 and xib 13 inhibit especially Kv1.1 channels which are important in controlling neuronal excitability as they are abundantly expressed in the nervous system [61].In certain neuroinflammatory diseases such as multiple sclerosis and spinal cord injury, Kv1.1 channels show an altered axonal localization at the juxtaparanodal sites and hereby prevent electrical conduction along the neurons.It is reported that inhibiting Kv1.1 channels results in a reduction of pathological manifestations [62,63].
In addition, xib 2a and xib 2b interacted primarily with Kv channels (Kv1.1,Kv1.2, Kv1.3, and Kv1.6) with only Nav1.4 being weakly inhibited by xib 2a .The higher and more specific inhibition of Kv1.6 compared to other Kv channels by both Xib 2 variants has direct application potential.Elevated Kv1.6 channel expression is known to be associated with pathological neuronal conduction in epileptic variants of epilepsy.The challenge so far has been that toxins that act on Kv1.6 channels also crossreact with other Kv channel types [64].The characteristics of xib 2 suggest that this toxin, perhaps with some synthetic modifications, could be of interest for applied studies of Kv1.6.From a biological perspective, xib 2 is among the most highly expressed venom components in X. tulumensis, which implies its functional importance linked to the venom biology that remains, however, speculative.Interestingly, similar Kv channels are affected by all xibalbin variants (with the difference that no and only one Nav channel is affected by xib 2a and xib 2b ), despite their rather different primary sequences.A possible explanation for this phenomenon could be the close phylogenetic relationship between xib 1 and xib 2 .We will discuss later that xib 2 originates from xib 1, which could suggest that xib 2 might have been adapted to increase the quantity of Kv-inhibiting toxins in remipede venom.
Interestingly, no xibablin showed inhibitory activity on Shaker, an insect Kv channel from Drosophila melanogaster; only mammalian voltage-gated potassium channels were affected.On the other hand, besides the activity recorded on mammalian voltage-gated sodium channels, BgNav, an insect Nav from the cockroach Blattella germanica, was targeted by Xib 1 and Xib 13 , being the most affected Nav channel by the latter.
It was recently shown that the double ICK domainlike xib 3 (which was not part of this study) targets RyR channels and promotes calcium release [56].This suggests that calcium channel targeting is complementarily accomplished by xib 3 .Based on this new data, we speculate that the activities on ion channels might be divided among the xibalbins and that they act synergistically: xib 1 , xib 13 (on Kv and Nav) xib 2 (on Nav), and xib 3 (on RyR).It is important to note though that the folded structures of all of the variants that are tested so far are approximations of the naturally secreted and folded xibalbins, providing first insight into their likely bioactivity.Obtaining crude venom to fractionate the toxins and reveal their natural conformation and bioactivity remains a challenge due to the rarity of remipede individuals in their remote habitats.Although we see no indication of diminished activity, we must note that we could not perform washout or concentration-response experiments to safely exclude artifacts.Nonetheless, the activity of the synthesized variants shows some promising activities for applied research that should be investigated further.

Nociceptive neuron activity by xib 1 and xib 13
Sensory neurons exposed to xibalbins did not change in cell number, which indicates the absence of direct cytotoxic effects on cells (Additional File 7: Figure S5).Size and UCHL1 distribution of cells also remained the same among the tested groups suggesting there is no subgroup specific or neural cell death related to our xibalbin variants (Additional File 4: Figure S3).However, this is in line with other studies that test different ICK peptides such as GTx1-15, which has also been shown not to exert cytotoxicity in human cell lines even at high concentrations [65].The recorded effects on ion channels regulating neuronal membrane potentials makes it interesting to look into the effect of the xibalbin variants on neurons, and we tested their activity on cultured sensory neurons.We observed increased PKA-II and pErk1/2 activity by xib 1 and xib 13 but not by xib 2 .This is in line with our data on the ion channel activity because the inhibition of Kv channels by xib 1 and xib 13 can prolong the action potential of the neuron, which then leads to activation of PKA-II and Erk1/2.Thouta et al. [66] reported that mice lacking the Kv1.1 coding gene show higher neuronal hyperexcitability, which is in agreement with our data showing that a more potent Kv1.1 inhibitor is a stronger activator of PKA-II and Erk1/2 (Fig. 8C-F and Fig. 8C-D).Whether such an activation is solely driven by the effects on membrane potential-regulating ion channels is not clear.There may also be an additional metabotropic activity of xib 1 and xib 13 similar to toxins such as α-latrotoxin or α-Bungarotoxin [66,67].
We evaluated then the sensitization of rat DRGs by xib 1 and xib 13 .Analysis of the size of cells reveals that mainly small-sized cells are responding to xibalbin toxins meaning that mostly nociceptive neurons are their predominant target (Fig. 8 G, H).As PKA and ERK activity has been characterized to result in pain behavior [68][69][70][71], our results imply that xib 1 and xib 13 are able to induce nociceptive neuron activity and are thus potential candidates for pain treatment.It should be noted that the concentrations used were higher than those used in the electrophysiological tests and caution should be exercised in making direct comparisons and biological interpretations.We can only speculate if this activity might reflect predatory or defensive functions of the natural xib 1 and xib 13.There is evidence that GPCR and RTK signaling already emerged in unicellular ancestors of metazoan and bilaterian species [72,73].In higher organisms, for example in Aplysia, GPCR signaling activating the cAMP/PKA pathway has been shown to regulate nociception [74] and the contribution of PKA and ERK to nociceptor hyperexcitability has been demonstrated [75,76].These studies suggest that the tested rat DRGs could theoretically reflect also possible activity in prey or predators of X. tulumensis [77][78][79].

Evolutionary perspective on xibalbins
Our phylogenetic analysis illuminates first the possible origins of xib 13 , which is present in all five remipedes in a well-supported clade.Closely related to xib 13 are sequences from early hexapod lineages, indicating that an older xib 13 variant existed already in the common ancestor of hexapods and remipedes.More distantly related sequences are from non-venomous crustaceans (malacostracans, notostracans) and chelicerates; however, our topology is not sufficiently resolved in the deeper, more ancestral nodes to draw here further conclusions.Therefore, we can formulate two possible hypotheses on the deeper origin of xib 13 : either a common ancestral variant already existed in the ancestor of pancrustaceans and chelicerates, or xib 13 -like proteins evolved convergently in pancrustaceans and chelicerates.Given the clustering of protein embeddings of xib 13 with other arthropodan ICKs in the machine learning analysis of protein space, the former scenario is more likely.
Xib 1 sequences occur in all remipedes except G. frondosus.Nevertheless, it appears that xib 1 is a common venom component in remipedes that is more unique to this group, which is reflected in the highly supported clade with remipede-only sequences (Fig. 9) and the protein space clustering (Fig. 10).Given the highly similar eightcysteine scaffold of xib 1 and xib 13 , our original hypothesis was that these two peptides are the result of a gene duplication that is followed by subsequent adaptation and high divergence of the sequences within the remipede lineage.However, both phylogenetic and machine learning results reveal separated monophyletic clades for both and thus rather support a separate origin probably from older gene duplication (Figs. 9 and 10).Xib 1 experienced conservatively estimated at least two duplication events as seen in Fig. 9 (two different variants in Pleomothra); however, without genome data, this is difficult to interpret.
Surprising is our finding that one duplication event of xib 1 within Xibalbanus leads to the origin of xib 2 with an ancestral transitional form (c149924g1i2) that has still eight cysteine patterns similar to xib 1 ; see Figs. 2, 9, and 10.The primary sequence of this ancestral xib 2 is highly similar to the other xib 2 members but shorter.To draw a certain evolutionary scenario for this finding is somewhat challenging.In general, two novel cysteines could trivially appear by coupled point mutations in the sequence as it was shown in other venom proteins.For example, snake venom phospholipases A2 often evolve novel cysteines for polymerization, while three-finger toxins have evolved a novel inner bond [80,81].However, the caveat of this hypothesis is the shorter sequence.A less parsimonious hypothesis is that the duplication of the c149924g1i1 xib 1 gene variant resulted in domain gain that added a sequence stretch in which two further cysteines evolved later.However, without genomic data, any hypothesis remains quite speculative also because the genomic processes that lead to the evolution of ICK peptides and their cysteine scaffolds are not yet studied in detail.Nevertheless, the origin of xib 2 must underlie an important evolutionary constraint.It is a unique component of the venom gland secretion, implying that it is recruited and expressed exclusively in the venom system [27].However, its high expression (fourth highest expressed venom component in X. tulumensis) suggests that it may be functionally more important than xib 13 and xib 1 .The reasons for this, such as an adaptation to prey or predator, remain to be uncovered.
Less expressed but also Xibalbanus-specific is xib 3 , which was not tested in this study but by Maxwell et al. [56].We support their findings that the two six-cysteine scaffold domains of this double (domain) knottin are closely related to six-cysteine ICK sequences from insects (Figs. 10 and 11).Maxwell and colleagues argue that they evolved by duplication from an ancestral variant that is lost in Xibalbanus.We do not reject this hypothesis, however, given our results, it also implies the loss of sixcysteine variants in all other remipedes and crustaceans.However, our topology is not fully resolved in this part and we refrain from further conclusions.
To better understand the origin and remarkable convergent evolution of ICKs as highly adaptive peptides among major arthropod groups such as insects, crustaceans, and chelicerates, an extended analysis incorporating new proteo-transcriptome venom data from additional remipede species would be advantageous.It is equally crucial to obtain genome data from these arthropods, as recent research has demonstrated that solely utilizing proteo-transcriptome data in addressing genespecific inquiries including gene origin and duplication can be flawed [23,24,82,83].Xibalbins are fundamental ICK-like peptides from an evolutionary viewpoint, providing insights into the emergence and evolution of this diverse toxin category, primarily in remipedes and hexapods, but also among other arthropods (Figs. 9, 10, and 11).

Conclusions
Although structure and function of naturally occurring single domain xibalbin fractions from remipede venom remain be to explored, our results based on synthesized variants support first functional insights that they affect predominantly potassium channels in combination with induced pain.Our results also show how important genome data will be to untangle the origin of xibalbins as part of the ICK-like venom protein family.As remipedes are the sister group to insects, the future identification of their ICK-like venom genes, along with an initial depiction of their conceivable mode of action and their phylogeny, is a crucial stepping stone to better understand the Sequences with a similar cysteine scaffold (number of cysteins given in brackets) found in insects, other crustaceans, and remaining arthropods are illustrated as well in the same shape but different taxon-specific colors (chelicerates = green, myriapods = brown, insects = gray).Six cysteine scaffolds that are not found in remipedes or crustaceans are shown in the shape of a rhombus.Possible ancestral variants are indicated with a question mark and numbers on top for different alternative hypotheses.The phylogeny is taken from the most recent phylogenomic analysis of pancrustaceans [34] function and evolution of this hyper-diverse toxin group in remipedes and pancrustaceans.

Prediction of mature sequences of xibalbin variants and chemical synthesis
From the previous proteo-transcriptomic venom analysis of X. tulumensis xib 1 and xib 2 transcripts that are highly expressed and supported on proteome level by MALDI mass spectrometry were used as core sequences for this study [27,84].The two xib 1 and xib transcripts (c27367_g1_i2_2 and c29168_g1_i2_VG_0) show high sequence similarity with known ∂-omegatoxin, which are knottins found in funnel web spiders with an ICK motif based on 8C residues [26,85].For xib 2 , only one higher expressed transcript (c29772_g1_i1_trinity_VG_ASS_5) was validated on the proteome level.Xib 2 peptides feature an unusual 10C scaffold knottin-like sequence.Currently, no detailed analysis and structural conformation is known for single-domain xibalbins.We predicted the mature sequences of xib 1 and xib 13 by aligning these proteins with a representative highly similar structurally known ∂-omega toxin peptide from spider venom [85] that was already included in von Reumont et al. (2017) [27,84] using EMBOSS secondary structure and cleavage site-packages (V1) within the software package Geneious Prime (2022.2.2); see Fig. 2. The domain area of xibalbin 2 was predicted by aligning all transcript variants recovered in the proteo-transcriptomics study [27] and by using the EMBOSS secondary structure and cleavage site packages (V1) within the software package Geneious Prime (2022.2.2).We test here two variations (different cleavage sites) of these sequences with differing lengths of the mature protein; see Fig. 2.

Evaluation of disulfide bonds by mass spectrometry
All chemically synthesized peptides were ordered at Vivitide, Gardner, MA, USA, performing the in-house solid-state synthesis by a standard automated peptide synthesizer.A non-directed refolding process using reduced/oxidized glutathione established thermodynamically stable disulfide bridges.In-house HPLC was run to evaluate the purity for all compounds with the following results: xib 1 = 88.0%,xib 12 = 84.9%,xib 2a = 99.9%, and xib 2b = 74.4% (Additional File 1: Figure S1).To verify sequence and structure, we sequenced all final xibalbin variants by bottom-up mass spectrometry analysis as digested peptides and as intact peptides (top-down).All xibalbin peptide sequences and purity (background noise of non-target fragments) were confirmed by tandem mass spectrometry (MS/MS) of the most abundant peptide fragments (Additional File 11: Figure S8).
The lyophilized undigested peptides were dissolved in ultrapure water (10 mg/mL), and either directly set to final concentration (2 mg/mL) with acidified ammonium bicarbonate buffer (25 mM ABC, 0.1% TFA, pH 7.8) for intact peptide mass analysis, or previously reduced (10 mM dithiothreitol), alkylated (20 mM iodoacetamide), and finally digested (15 ng/µL trypsin, Promega, Madison, United States) for bottom-up peptide analysis.In addition, native xibalbin variants were treated by either reduction (10 mM DTT) and alkylation (20 mM IAC) or direct digestion (15 ng/µL trypsin, Promega, Madison, USA).The matrix was prepared using a saturated stock solution of α-cyano-4-hydroxycinnamic acid (CHCA) in acetonitrile/ultrapure water (70% acetonitrile with 0.1% trifluoroacetic acid (TFA)).First, samples were spotted by dried droplet onto the ground steel target plate and matrix solution added on top.Mass analyses were performed on a MALDI-ToF/ToF mass spectrometer (ultra-fleXtreme, Bruker Daltonics, Bremen, Germany) with the following operating settings: ion source 1 = 20.00kV, ion source 2 = 17.75 kV, lens voltage = 8.00 kV, reflector voltage = 20.80 kV, optimized pulsed ion extraction time = 170 ns, matrix suppression = 600 Da, sample rate 5.00 GS/s, analog offset 68.90 mV, and positive reflectron (bottom-up peptide identification) or linear (intact peptide identification) mode.Laser strength and pulse ion extraction time were initially optimized and performance was calibrated using the calibrant peptide standard mixture.Peptides from mass spectra of in-solution digest samples were manually matched against the xibalbin amino acid sequences.
Identification of masses for refolded xib  S9).However, we have to note that the exact disulfide connectivity was not determined and that the synthetic peptides may not represent the natural peptides.All mass spectrometry proteomics data (.fid files) have been deposited via the MassIVE partner repository [86] under project name "Non-cytotoxic xibalbin ICK variants from remipede crustaceans" with the data set identifier MSV000091677 [87].
Electrophysiological measurements were performed at room temperature (18-22 °C) using the two-electrode voltage clamp (TEVC) technique.Data were obtained using a GeneClamp 500 amplifier (Axon Instruments, USA), and Clampex9 software (Axon Instruments), responsible for data acquisition and storage.Glass micropipettes were produced using glass capillaries (borosilicate WPI 1B120-6) and drawn in a WPI (World Precision Instruments, USA) manual stretcher.The bath and perfusion solutions were either the previously described ND96 (Na v and K v channels) or calcium-free ND96 supplemented with 10 mM BaCl 2 (Cav channels).
Whole-cell currents of oocytes were recorded 1 to 3 days after injection.Current and voltage electrodes were filled with 3 M KCl and their resistance was adjusted from 0.7 to 2.0 MΩ.Currents were sampled at 20 kHz (Na v channels) and 10 kHz (K v and Ca v channels) and filtered using a four-pole Bessel low-pass Bessel filter, at 1 kHz for sodium, and 500 MHz for potassium and calcium, except for the hERG ion channel, in which the currents were filtered at 1 kHz.Leak subtraction was performed using a -P/4 protocol.K v 1.x currents were evoked by 500 ms depolarizations to 0 mV followed by a 500-ms pulse to − 50 mV, from a holding potential of − 90 mV.K v 2.1, K v 3.1, and K v 4.2 currents were elicited by 500 ms pulses to + 20 mV from a holding potential of − 90 mV.Current traces of Kv10.1 were elicited by 2 s depolarization to 0 mV, from a holding potential of − 90 mV.Current traces of hERG1 channel were elicited by applying a + 40 mV prepulse for 2 s followed by a step of − 120 mV for 2 s.Sodium current traces were evoked by a 100 ms depolarization to 0 mV, from a holding potential of − 90 mV.The current-voltage (IV) relationships were determined by 100-ms step depolarizations between − 90 and + 40 mV, using 5 mV increments.For Ca v channels, current traces were elicited by 700 ms depolarizations to − 20 mV from a holding potential of − 90 mV.Current values were expressed as means ± SEM of at least three independent experiments.Differences in ionic currents between control and sample conditions were compared by one-way ANOVA, followed by Dunnet multiple comparisons test.The shifts in V half of activation and inactivation of Nav channels were compared by one-way ANOVA, followed by Tukey's multiple comparisons test.Differences were considered statistically significant when p < 0.01.
DRGs were stimulated after overnight incubation.Compounds were prepared as tenfold concentrated stock solutions, diluted in PBS (PAA, cat# H15-002), in v-bottom plates; 50 µL media from culture wells were mixed with the 12.5 µL stock solutions, and 50 µL added back to the respective wells.Stimulation was performed with automated multichannel pipette and cells were kept in heated block during the stimulation.Cells were then fixed for 10 min at RT with paraformaldehyde (final concentration: 4%, Cat# 1.04005) at desired time points.Fixed cells were washed twice with PBS.Following blocking and permeabilization (2% normal goat serum (Dianova, Hamburg, Germany, #005-000-121), 1% BSA, 0.1% Triton X-100 (Roth, Karlsruhe, Germany, #3051.2),0.05% Tween 20 (Sigma-Aldrich, #P9416)) for 1 h at RT, respective primary antibodies diluted in 1%BSA in PBS was added to the cells and incubated for and overnight at 4 °C.After that, cells were washed three times with PBS (10 min) and secondary antibodies (1:1000, fluorescently labeled) and DAPI (50 ng/ml-1) for 1 h at RT in dark.Finally, cells were washed three times with PBS (10 min) and wells were filled with PBS, sealed, and kept at 4 °C until scanning.
Stained cells were scanned with a CX7-LZR (Thermo Fisher Scientific) HCI system.Images were acquired with a 10 × objective and analyzed using the cellomics software package (Thermo Fisher Scientific).UCHL1 channel was used as a marker to identify neurons.Object selection was further based on the following criteria: 120-6000 µm 2 ; circularity: 1-2; length-to-width ratio: 1-2; average intensity: 250-2000; and total intensity: 6 × 104 to 5 × 106.The resulting objects were quantified for average object intensity in all other color channels.Untreated wells were used for normalization and compensation was performed for minimizing spill over between channels.All analyses were conducted using R programming language and RStudio as integrated development environment (IDE).One-way ANOVA, Dunnett's test post hoc was performed to evaluate statistical significance between groups.The difference between the two means (D) divided by the standard error of that difference (computed from all the data): q = D/SED.
All sequences of xib 1 , xib 2, and xib 13 alignments (Additional File 21: Data S10, Additional File 22: Data S11, Additional File 23: Data S12) were combined with known ICK toxins and highly similar sequences from non-venomous arthropods available in UniProt combining the sequences used in Maxwell et al. [56,139] (with separated first and second double ICK domains of xib 3 ) and von Reumont et al. [27]; see Additional File 24: Data S13.Signal peptide, propeptide, and mature regions were separately aligned for all sequences with optimization strategy for one domain using Mafft-L-INS-I [140] and then concatenated.The phylogenetic tree was reconstructed with IQ-TREE2 [141,142] on 56 cores using settings for rapid bootstraps with integrated model fitting and branch length optimization (MFP, -B 25000, -bnni, -T 56).The original tree (Additional File 9: Data S1) was condensed for Fig. 9 by collapsing all nodes below a support value of 50.The complementary CLANS analysis was performed with standard settings using the Java version 29.05.2012 and 81,987 rounds [143].

Construction of ICK embedding protein language model space
We leveraged modern advances in machine learning, in particular-in natural language models adopted to work with proteins-protein language models (or pLMs) [144].These models have been successfully used to create protein space for various datasets, including our own work [23,81].It was shown that distance in embedding space correlates with protein function and can be used as an orthogonal signal for clustering proteins into functional families [145].
Here, we used the pLM ProtT5-XL-UniRef50 [144] (in the following ProtT5) to create fixed-length vector representations for each protein sequence (per-protein embeddings) irrespective of its length.To achieve that, we first created individual vector representations for each residue in a protein and then averaged over all residue embeddings in a protein to derive fixed-length vector representations for single proteins (per-protein embedding) irrespective of a protein's length.As ProtT5 was only trained on unlabeled protein sequences and no supervised training or fine-tuning was performed, there is no risk of information leakage or overfitting to a certain class or label.As a result, every protein was represented as 1024-dimensional per-protein embeddings.Those high-dimensional representations were projected to 3D using UMAP (n_neighbors = 25, min_dist = 0.5, random_state = 42, n_components = 3) and colored according to their respective group to allow for visual analysis.Embeddings were created using the bio_embeddings package [145].Interactive 3D plots of protein spaces are given in Additional File 25: Data S14 (proteins labeled according to taxa clades and protein families) and Additional File 26: Data S15 (proteins labeled according to cysteine scaffold) and were reconstructed using the algorithm deposited on github: https:// github.com/ Rostl ab/ RostS pace.

Fig. 1 Fig. 2 Fig. 2 (
Fig.1Overview of published data used for our study to investigate bioactivity of the higher expressed ICK-like peptides.A Habitus of an adult remipede X. tulumensis.B The anatomy of the venom system as described in von Reumont et al.[26] is illustrated by blending a synchrotron-based microcomputer tomography reconstruction into a light microscope picture.All components of the venom apparatus (venom gland, venom duct, and venom reservoir) are shown in magenta.C Proteo-transcriptome data used as a base for our study is illustrated in a modified graphic.Only proteome-verified transcripts of the venom profile described in von Reumont et al.[27] are shown with their expression levels on the x-axis.The three single ICK domain protein families xibalbin 1, xibalbin 2, and xibalbin 13 are highlighted.The fourth family of knottin-like proteins, xibalbin 3 , which is a double ICK-like domain peptide and not the subject of this study, is indicated in gray

Fig. 3
Fig. 3 Electrophysiological characterization of xib 1 (A), xib 13 (B), xib 2a (C), and xib 2b (D), at 1 µM, on a panel of Kv channels.The black lines represent the control condition, while the red lines indicate the current obtained after the addition of each peptide.The dotted lines represent the 0 current level.The graphs illustrate the effects obtained in a series of at least three independent experiments (n ≥ 3); see "Methods" and Additional File 2: Table S1 for individual data values

Fig. 4
Fig. 4 Electrophysiological characterization of xib 1 (A), xib 13 (B), xib 2a (C), and xib 2b (D), at 1 µM, on a panel of Nav channels.The black lines represent the control condition, while the red lines indicate the current obtained after the addition of each peptide.The dotted lines represent the 0 current level.The graphs illustrate the effects obtained in a series of at least three independent experiments (n ≥ 3); see "Methods" and Additional File 2: Table S1 for individual data values

Fig. 5
Fig. 5 Electrophysiological characterization of xib 1 (A), xib 13, (B), xib 2a (C), and xib 2b (D), at 1 µM, on Cav3.1, Cav3.2, and Cav3.3.The black lines represent the control condition, while the red lines indicate the current obtained after the addition of each peptide.The dotted lines represent the 0 current level.The graphs illustrate the effects obtained in a series of at least three independent experiments (n ≥ 3); see "Methods" for details

Fig. 7
Fig. 7 Activation (closed symbols) and steady-state inactivation (open symbols) curves in control (black) and toxin (red) condition for each Nav channel in the presence of 1 µM of xib 1 (A), xib 13 (B), and xib 2a (C).(n ≥ 3) ± SEM; SEM standard error of the mean.Statistics were calculated using one-way ANOVA, followed by Tukey's multiple comparisons test.Differences were considered statistically significant when p < 0.01; see Tables1 and 2. See "Methods" and Additional File 2: TableS1for individual data values

Fig. 8 A
Fig. 8 A Representative images of rat DRG neurons stimulated with PBS (Ctrl) and xib 1b (0.8 µg/µL).Cells were stained for UCHL1, pRII, pERK1/2, and Hoechst followed by fixation and staining with standard immunocytochemistry protocols.Green-encircled neurons indicate automatically selected objects.Scale bar: 100 µm.B Mean intensities of UCHL1, pRII, and pERK1/2.The same UCHL1 intensities were observed among tested groups.Forskolin (Fsk), an activator of adenylyl cyclase, was used as a positive control.Fsk at 6 µM concentration induced an increase in pRII and pERK1/2 intensities at 5 and 30 min.Number of analyzed cells per replica: 750 ± 200.Eight independent replicas.C-F Concentration responses of xibalbin toxins (0.005 to 0.8 µg/µL) and mean intensities of pRII and pERK1/2 at 5 min and 30 min.G-H Size versus pRII and pErk1/2 intensities of cells treated with 0.8 µg/µL of xib 1 and xib 13 .Small-sized cells were responding to xibalbins.Statistical significance was assessed using one-way ANOVA with Dunnett's post hoc test.Data represent mean ± S.E.M (standard error of the mean), n = 4, see "Methods" for details and Additional File 6: Table S2 for individual data values

Fig. 9
Fig.9 Phylogeny of ICK-like proteins in major arthropod groups and knottin-like xibalbins in remipedes.Nodes for which both support values are below 50 are shown in multifurcation, the first number gives the SH-aLRT support in percent, and the second number the fast bootstrap support.The relevant node values for xibalbin protein clades are printed in bold blue.Chelicerates are green, myriapods brown, and crustaceans in blue, while hexapods are colored in black.Major groups are indicated.The cysteine scaffold of ICK-like sequences is shown in white in the dark blue bars.The tree was calculated in IQTREE (-m MFP -alrt 25,000 -B 25000 -bnni -T Auto); for further details, see "Methods".We summarized several nodes for better visualization (triangles); see the full tree file (Additional File 9: FigureS6)

Fig. 11
Fig. 11 Synopsis of the evolution of xibalbins.Xib 1 (yellow circle), xib 13 (red circle), xib 2 (blue triangle), and xib 3 (blue square) in remipedes are depicted.Sequences with a similar cysteine scaffold (number of cysteins given in brackets) found in insects, other crustaceans, and remaining arthropods are illustrated as well in the same shape but different taxon-specific colors (chelicerates = green, myriapods = brown, insects = gray).Six cysteine scaffolds that are not found in remipedes or crustaceans are shown in the shape of a rhombus.Possible ancestral variants are indicated with a question mark and numbers on top for different alternative hypotheses.The phylogeny is taken from the most recent phylogenomic analysis of pancrustaceans[34]

Table 1
Shifts in current-voltage relationships of voltage-gated sodium channels in control (ND96) and in the presence of 1 µM xib 1 .**p< 0.01, Fig.7A